Finding Maximum Colorful Subtrees in Practice
نویسندگان
چکیده
In metabolomics and other fields dealing with small compounds, mass spectrometry is applied as a sensitive high-throughput technique. Recently, fragmentation trees have been proposed to automatically analyze the fragmentation mass spectra recorded by such instruments. Computationally, this leads to the problem of finding a maximum weight subtree in an edge-weighted and vertex-colored graph, such that every color appears, at most once in the solution. We introduce new heuristics and an exact algorithm for this Maximum Colorful Subtree problem and evaluate them against existing algorithms on real-world and artificial datasets. Our tree completion heuristic consistently scores better than other heuristics, while the integer programming-based algorithm produces optimal trees with modest running times. Our fast and accurate heuristic can help determine molecular formulas based on fragmentation trees. On the other hand, optimal trees from the integer linear program are useful if structure is relevant, for example for tree alignments.
منابع مشابه
Speedy Colorful Subtrees
Fragmentation trees are a technique for identifying molecular formulas and deriving some chemical properties of metabolites small organic molecules solely from mass spectral data. Computing these trees involves nding exact solutions to the NP-hard Maximum Colorful Subtree problem. Existing solvers struggle to solve the large instances involved fast enough to keep up with instrument throughput, ...
متن کاملComplexity of Finding Nearest Colorful Polytopes
Let P1, . . . , Pd+1 ⊂ R be point sets whose convex hulls each contain the origin. Each set represents a color class. The Colorful Carathéodory theorem guarantees the existence of a colorful choice, i.e., a set that contains exactly one point from each color class, whose convex hull also contains the origin. The computational complexity of finding such a colorful choice is still unknown. We stu...
متن کاملEnumeration of subtrees of trees
Let T be a weighted tree. The weight of a subtree T1 of T is defined as the product of weights of vertices and edges of T1. We obtain a linear-time algorithm to count the sum of weights of subtrees of T . As applications, we characterize the tree with the diameter at least d, which has the maximum number of subtrees, and we characterize the tree with the maximum degree at least ∆, which has the...
متن کاملThe Union of Colorful Simplices Spanned by a Colored Point Set
A simplex spanned by a colored point set in Euclidean d-space is colorful if all vertices have distinct colors. The union of all full-dimensional colorful simplices spanned by a colored point set is called the colorful union. We show that for every d ∈ N, the maximum combinatorial complexity of the colorful union of n colored points in R is between Ω(n(d−1) 2 ) and O(n(d−1) 2 log n). For d = 2,...
متن کاملColorful Subhypergraphs in Uniform Hypergraphs
There are several topological results ensuring in any properly colored graph the existence of a colorful complete bipartite subgraph, whose order is bounded from below by some topological invariants of some topological spaces associated to the graph. Meunier [Electron. J. Combin., 2014] presented the first colorful type result for uniform hypergraphs. In this paper, we give some new generalizat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of computational biology : a journal of computational molecular cell biology
دوره 20 4 شماره
صفحات -
تاریخ انتشار 2012